Search CORE

30 research outputs found

Monolingual and bilingual spanish-catalan speech recognizers developed from SpeechDat databases

Author: Mariño Acebal José Bernardo
Moreno Bilbao M. Asunción
Nadeu Camprubí Climent
Padrell J
Publication venue: C. Draxler
Publication date: 01/01/2000
Field of study

Under the SpeechDat specifications, the Spanish member of SpeechDat consortium has recorded a Catalan database that includes one thousand speakers. This communication describes some experimental work that has been carried out using both the Spanish and the Catalan speech material. A speech recognition system has been trained for the Spanish language using a selection of the phonetically balanced utterances from the 4500 SpeechDat training sessions. Utterances with mispronounced or incomplete words and with intermittent noise were discarded. A set of 26 allophones was selected to account for the Spanish sounds and clustered demiphones have been used as context dependent sub-lexical units. Following the same methodology, a recognition system was trained from the Catalan SpeechDat database. Catalan sounds were described with 32 allophones. Additionally, a bilingual recognition system was built for both the Spanish and Catalan languages. By means of clustering techniques, the suitable set of allophones to cover simultaneously both languages was determined. Thus, 33 allophones were selected. The training material was built by the whole Catalan training material and the Spanish material coming from the Eastern region of Spain (the region where Catalan is spoken). The performance of the Spanish, Catalan and bilingual systems were assessed under the same framework. The Spanish system exhibits a significantly better performance than the rest of systems due to its better training. The bilingual system provides an equivalent performance to that afforded by both language specific systems trained with the Eastern Spanish material or the Catalan SpeechDat corpus.Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

SVMs for Automatic Speech Recognition: a Survey

Author: Díaz de María Fernando
Gallardo Antolín Ascensión
Martín Iglesias D.
Padrell Sendra J.
Peláez Moreno Carmen
Solera Ureña R.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Hidden Markov Models (HMMs) are, undoubtedly, the most employed core technique for Automatic Speech Recognition (ASR). Nevertheless, we are still far from achieving high-performance ASR systems. Some alternative approaches, most of them based on Artificial Neural Networks (ANNs), were proposed during the late eighties and early nineties. Some of them tackled the ASR problem using predictive ANNs, while others proposed hybrid HMM/ANN systems. However, despite some achievements, nowadays, the preponderance of Markov Models is a fact. During the last decade, however, a new tool appeared in the field of machine learning that has proved to be able to cope with hard classification problems in several fields of application: the Support Vector Machines (SVMs). The SVMs are effective discriminative classifiers with several outstanding characteristics, namely: their solution is that with maximum margin; they are capable to deal with samples of a very higher dimensionality; and their convergence to the minimum of the associated cost function is guaranteed. These characteristics have made SVMs very popular and successful. In this chapter we discuss their strengths and weakness in the ASR context and make a review of the current state-of-the-art techniques. We organize the contributions in two parts: isolated-word recognition and continuous speech recognition. Within the first part we review several techniques to produce the fixed-dimension vectors needed for original SVMs. Afterwards we explore more sophisticated techniques based on the use of kernels capable to deal with sequences of different length. Among them is the DTAK kernel, simple and effective, which rescues an old technique of speech recognition: Dynamic Time Warping (DTW). Within the second part, we describe some recent approaches to tackle more complex tasks like connected digit recognition or continuous speech recognition using SVMs. Finally we draw some conclusions and outline several ongoing lines of research

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Modeling of the flux decline in a continuous ultrafiltration system with multiblock partial least squares

Author: Cervera-Padrell Albert Emil
Klimkiewicz Anna
van der Berg Franciscus Winfried J
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2016
Field of study

Copenhagen University Research Information System

Monitoring and Control of a Continuous Grignard Reaction for the Synthesis of an Active Pharmaceutical Ingredient Intermediate Using Inline NIR spectroscopy

Author: Albert E. Cervera-Padrell
Asmus R. Mortensen
Bakeev K. A.
Barthe P.
Behr A.
Carter C. F.
Cervera-Padrell A. E.
Ferstl W.
Hartman R. L.
Hartman R. L.
Hartman R. L.
Hartman R. L.
Jensen K. F.
Jesper P. Nielsen
Jiménez-González C.
Kang L.
Kim Dam-Johansen
Kim Müller Christensen
Kockmann N.
Kockmann N.
Kralj J. G.
Krist V. Gernaey
LaPorte T. L.
Loebbecke S.
McMullen J. P.
McMullen J. P.
McMullen J. P.
Mendorf M.
Michael Jønch Pedersen
Osborne B. G.
Petersen N.
Plumb K.
Pollet P.
Roberge D. M.
Roberge D. M.
Roggo Y.
Sahoo H. R.
Schaber S. D.
Silverman G. S.
Søren Kiil
Tommy Skovby
Vervaet C.
Webb D.
Wiss J.
Wiss J.
Wold S.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2012
Field of study

Crossref

Online Research Database In Technology

Continuous Hydrolysis and Liquid–Liquid Phase Separation of an Active Pharmaceutical Ingredient Intermediate Using a Miniscale Hydrophobic Membrane Separator

Author: Albert E. Cervera-Padrell
Aota A.
Aota A.
Aota A.
Barthe P.
Behr A.
Berthier J.
Bruus H.
Cai Z.
Castell O. K.
Chen X.
Daniel J. Lewandowski
Hartman R. L.
Hartman R. L.
Hartman R. L.
Jensen K. F.
Jönsson J. Å.
Kockmann N.
Kolehmainen E.
Kraai G. N.
Kraai G. N.
Kralj J. G.
Krist V. Gernaey
LaPorte T. L.
Mendorf M.
Müller Christensen K.
Ni N.
Nord L.
Okubo Y.
Okubo Y.
Plumb K.
Pollet P.
Roberge D. M.
Roberge D. M.
Sahoo H. R.
Schuur B.
Schuur B.
Silverman G. S.
Smirnova A.
Sofie T. Morthensen
Søren Kiil
Tommy Skovby
Varnik F.
Watanabe K.
Zhao B.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 01/01/2012
Field of study

Crossref

Online Research Database In Technology

Development of a Multi-Step Synthesis and Workup Sequence for an Integrated, Continuous Manufacturing Process of a Pharmaceutical

Crossref

Histologic distribution of insulin and glucagon receptors

Author: Abele V
Abrahamsen N
Agungpiryono S
Alison BC
Bailly C
Bar RS
Baskin DG
Baskin DG
Benecke H
Bergeron JJM
Bergeron JJM
Bergeron JJM
Bergeron JJM
Berthould VM
Burcelin R
Burns TW
Camps M
Carvalho CR
Christophe J
Clark Jr CM
Cuello AC
Curtis CG
De Meyts P
Delahouse M
Desbuquois B
Desoye G
Dorn A
Felig P
Frandsen EK
Gammeltoft S
Goldstein RH
Gregersen H
H. Hayasaki
Hagen JH
Hansen LH
Hansen MD
Havrankova J
Havrankova J
Hill JM
Hirose Y
Hoosein NM
Hyndmann AG
Iwanij V
Iwanij V
Iyengar R
Jacobs S
Jelinek LJ
Jin SL
Jo N
Joost HG
Kahn CR
Kawai K
Kofod H
Koh WS
Kuhar MJ
Kuno N
Lee J
Lee YC
Lee YC
Lienhard GE
Lok S
Loubassou S
M. Shimada
M. Watanabe
MacLeod KM
MacLusky NJ
MacNeil DJ
Marks JL
Martineau Doizé B
McClain DA
Mojsov S
Montaron A
Munoz P
Nakanishi M
Nishimura Y
Padrell E
Pansky B
Robberect P
Rogers AW
Rogers AW
Rogers AW
Sakamoto C
Santamaria L
Saucier J
Sechi LA
Segu L
Seino S
Seino S
Shimada M
Shimada M
Shimada M
Shimada M
Shimono R
Sjolund K
Svoboda M
Svoboda M
T. Tamayama
Under HR
Unson CG
Unson CG
Unson CG
Van Schravendijk CFH
Vinters HV
Wakelam MJ
Watanabe J
Watanabe J
Watanabe M
Watanabe M
Watanabe M
Watanabe M
Watanabe M
Watanabe M
Watanabe M
Watanabe M
Watanabe M
Watanabe M
Werther GA
Whitcomb DC
Wolf HJ
Yamaguchi Y
Yip CC
Yoo-Warren H
Yount EA
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

On frequency averaging for spectral analysis in speech recognition

Author: Galindo F
Nadeu Camprubí Climent
Padrell J
Publication venue: Robert H. Mannel and Jordi Robert-Ribes
Publication date: 01/01/1998
Field of study

Many speech recognition systems use logarithmic filter-bank energies or a linear transformation of them to represent the speech signal. Usually, each of those energies is routinely computed as a weighted average of the periodogram samples that lie in the corresponding frequency band. In this work, we attempt to gain an insight into the statistical properties of the frequency-averaged periodogram (FAP) from which those energies are samples. Thus, we have shown that the FAP is statistically and asymptotically equivalent to a multiwindow estimator that arises from the Thomson’s optimization approach and uses orthogonal sinusoids as windows. The FAP and other multiwindow estimators are tested in a speech recognition application, observing the influence of several design factors. Particularly, a technique that is computationally simple like the FAP’s one, and which is equivalent to use multiple cosine windows, appears as an alternative to be taken into considerationPeer Reviewe

UPCommons. Portal del coneixement obert de la UPC

On frequency averaging for spectral analysis in speech recognition

Author: Galindo F
Nadeu Camprubí Climent
Padrell J
Publication venue: Robert H. Mannel and Jordi Robert-Ribes
Publication date
Field of study

RECERCAT

Multilevel modeling for data mining of downstream bio-industrial processes

Author: Cervera-Padrell A. E.
Klimkiewicz A.
van der Berg Franciscus Winfried J.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Copenhagen University Research Information System